Dr3: Value-Based Deep Rl Requires Explicit Regularization